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Abstract — In this paper, we look at the problem of finding 
the tour of maximum reward on an undirected graph where 
the reward is a submodular function, that has a curvature 
of k, of the edges in the tour. This problem is known to 
be NP-hard. We analyze two simple algorithms for finding 
an approximate solution. Both algorithms require 0(|V| 3 ) 
oracle calls to the submodular function. The approximation 



factors are shown to be 



and max 



respectively; so the second method has better bounds for low 
values of k. We also look at how these algorithms perform 
for a directed graph and investigate a method to consider 
edge costs in addition to rewards. The problem has direct 
applications in monitoring an environment using autonomous 
mobile sensors where the sensing reward depends on the path 
taken. We provide simulation results to empirically evaluate the 
performance of the algorithms. 

I. Introduction 

The maximum weight Hamiltonian cycle is a classic prob- 
lem in combinatorial optimization. It consists of finding a 
cycle in a graph that visits all the vertices and maximizes the 
sum of the weights (i.e., the reward) on the edges traversed. 
Also referred to as the max-TSP, the problem is NP-hard 
and so no known polynomial time algorithms exists to solve 
it. However, a number of approximation schemes have been 
developed. In [1] four simple approximation algorithms are 
analysed. The authors show that greedy, best-neighbour, and 
2-interchange heuristics all give a ^ approximation to the 
optimal tour. They also show that a matching heuristic, which 
first finds a perfect 2-matching and then converts that to a 
tour, gives a | approximation. In [2], the authors point out 
that Serdyukov's algorithm — an algorithm which computes 
a tour using a combination of a maximum cycle cover 
and a maximum matching — can give a | approximation. 
They also give a randomized algorithm that achieves a || 
approximation ratio. In this paper we look at extending the 
max-TSP problem to the case of submodular rewards. 

The main property of a submodular function is that of 
decreasing marginal value, i.e., choosing to add an element 
to a smaller set will result is a larger reward than adding it 
later. One application in which submodular functions appear 
is in making sensor measurements in an environment. For 
example, in [3] the authors consider the problem of placing 
static sensors over a region for optimal sensing. If a sensor is 
placed close to another, then the benefit gained by the second 
sensor will be less that if the first sensor had not already been 
placed. This can be represented quantitatively by using the 
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concept of mutual information of a set of sensors, which is 
a submodular function. Other areas where submodular func- 
tions come up include viral marketing, active learning [4] 
and AdWords assignment [5]. A different form of sensing 
involves using mobile sensors for persistent monitoring of a 
large environment using a mobile robot [6]. The metric used 
to determine the quality of the sensing is usually submodular 
in nature. Due to the persistent operation, it is desirable to 
have a closed walk or a tour over which the sensing robot 
travels. This motivates the problem of finding a tour that has 
the maximum reward. 

Various results exist for maximizing a monotone submod- 
ular function over an independence system constraint. This 
problem is known to be NP-hard, even though minimization 
of a submodular function can be achieved in polynomial time 
([7], [8]). Approximation bounds exist for optimizing over a 
uniform matroid [9], any single matroid [10], an intersection 
of p matroids and, more generally, p-systems [11] as well as 
for the class of fc-exchange systems [12]. Some bounds that 
include the dependence on curvature are evaluated in [13]. 

Contributions: The contributions of this paper are to 
present and analyze two simple algorithms for constructing 
a maximum tour on a graph. The metric used in maximizing 
the "reward" of a particular tour is a positive monotone 
submodular function of the edges. We frame this problem as 
an optimization over an independence system constraint. The 
first method is greedy and is shown to have a approx- 
imation. The second method creates a 2-matching and then 
turns it into a tour. This gives a max 1 3 ( 2 + B ) ; |(1 — K )| 
worst case approximation where k is the curvature of the 
submodular function. Both techniques require 0(|y| 3 ) value 
oracle calls to the submodular function. The algorithms are 
also extended to directed graphs. To obtain these results, we 
present a new bound for the greedy algorithm as a function 
of curvature. We also present some preliminary results for 
the case of a multi-objective optimization consisting of sub- 
modular (sensing) rewards on the edges along with modular 
(travel) costs. We incorporate these two objectives into a 
single function, but it is no longer monotone nor is it positive. 
We provide bounds on the performance of our algorithms 
in this case, but they depend on the relative weight of the 
rewards. 

Organization: The organization of this paper is as follows. 
In Section [II] we review some material on independence 
systems, submodularity, graphs and approximation methods 
for submodular functions. In Section [Till we formalize our 



problem. In Section IV we analyze a simple greedy strategy. 
In Section [V] we present and analyze a strategy to construct 
a solution using a matching. In Section |VI| we look at how 



the presented algorithms extend to the case where the graph 



is directed. Finally, in Section VII we discuss a method 
to incorporate costs into the optimization. Some simulation 
results are provided in Section |VIII| comparing the given 
strategies for various scenarios. 

II. Preliminaries 

Here we present preliminary concepts and give a brief 
summary of results on combinatorial optimization problems 
over independence systems. 

A. Independence systems 

Combinatorial optimization problems can often be formu- 
lated as the maximization or minimization over a set system 
(E, F) of a cost function / : T — > R, where E is the base 
set of all elements and T C 2 E . An independence system 
is a set system that is closed under subsets (i.e., if A G T 
then B C A => B G J 7 ). Sets in T are referred to 
as "independent sets". The set of maximal independent sets 
(i.e., all A G T such that A U {x} £ J", Vz G E \ A) are the 
bases. 

Definition II. 1 (p-system). Given an independence system 
S = (E, F). For any A C E let 

U(A) := max \B\ 

{B:B is a basis of A} 

L(A) := min \B\ 

{B:B is a basis of A} 

be the sizes of the maximum and minimum cardinality bases 
of A respectively. For S to be a p-system, 

U(A) < pL(A),VA C E. 

Definition II.2 (p-extendible system). An independence 
system {E,F) is p-extendible if given an independent set 
B G T, for every subset A of B and for every x ^ A such 
that A U {x} G T, there exists C C B \ A such that \C\ <p 
and for which (B \ C) U {x} G J 7 . 

Remark II. 3. A p-extendible system is a p-system. • 

Definition II.4 (Matroid). An independence system (E, T) 
is a matroid if it satisfies the additional property: 

. If X,Y G T and \X\ > \Y\, then 3a; G X\Y with 

YU{x}eT 

Remark 11.5. A matroid system is a 1-extendible system. • 
Remark II. 6. Any independence system can be represented 
as the intersection of a finite number of matroids [14]. • 
An example of a matroid is the partition matroid. The base 
set is the union of n disjoint sets, i.e. E = (JlLi where 
Ei l"l Ej = for i ^ j. Also given k G Z™. The matroid is 
J:={AC£: |An£?i| < fc. ( ,Vi = l...n}. 

B. Submodularity 

Without any additional structure on the set-function /, 
the optimization problem is generally intractable. However, a 
fairly general class of cost functions for which approximation 
algorithms exist is the class of submodular set functions. 



Definition II.7 (Submodularity). Let TV be a finite set. A 
function / : 2 N — >• R is submodular if 

f(S) + f(T) > f(SUT) + f(SDT), VS,TCN. 

Submodular functions satisfy the property of diminishing 
marginal returns. That is, the contribution of any element 
x to the total value of a set can only decrease as the set 
gets bigger. More formally, let A A (B) := f(AL)B) - f(A). 
Then, 

A A (x) > A s (x), VACBCN. 

Since the domain of / is 2 N , there are an exponential 
number of possible values for the set function. As a result, 
enumerating the value of every single subset of the base set 
is not an option. We will assume that f(S) for any S C N 
is determined by a black box function. This value oracle is 
assumed to run in polynomial time in the size of the input 
set. 

The class of submodular functions is fairly broad and 
includes linear functions. One way to measure the degree 
of submodularity is the curvature. A submodular function is 
said to have a curvature of n G [0,1] if for any A C N and 

ee N\A 



A A (e) > (1 - «)/(e). 



(1) 



In other words, the minimum possible marginal benefit of 
any element e is within a factor of (1 — n) of its maximum 
possible benefit. 

We formulate a slightly stronger notion of the curvature 
- the independence system curvature, kj - by taking the 
independence system in to account. In this case, ([T| need 
only be satisfied for any A £ J 7 and e G N\A, A\J {e} G J '. 
This value of curvature will be lower than the one obtained 
by the standard definition given above. 

C. Greedy Algorithms 

The greedy algorithm is a simple and well-known method 
for finding solutions to optimization problems. The basic idea 
is to choose the "optimal" element at each step. So given a 
base set of elements E, the solution S is constructed as: 

(i) Pick the best element e from E. 

(ii) If S U {e} is a feasible set, then S = S U {e}. 

(iii) E = E\{e}. 

(iv) Repeat until S is maximal or E is empty. 

A general greedy algorithm for maximizing a submodular 
function over an independence system is given in Algo- 
rithm [T] Since the objective function is submodular, the 
marginal value of each element in the base set changes in 
every iteration and so has to be recalculated. This causes the 
runtime to be 0(|iV| 2 /) where / is the runtime of the value 
oracle. 

Based on the properties of submodularity, a more efficient 
implementation can be constructed. The specific property 
that helps here is that of decreasing marginal benefit. Given 
A C B and two elements ei, e-i ^ B. If A A (ei) < A^^), 
then we can conclude that As(ei) < A#(e2). Therefore, 
As(ei) does not need to be calculated. 



Algorithm 1: generalGreedy((£', T), /) 

Input: An independence system M = (E,!F). A 

function / : T — > R. 
Output: Basis of X. 

1 N<- E; 

2 while N 7^ and S not maximal in T do 

3 
4 



foreach e e N do 

|_ calculate A(e) := f(SU{e}) - f(S); 

s m :— argmaXg A(e); 

6 if S U {to} e J" then S <- S U {to}; 

7 N <~ N\ {to}; 

8 return S; 



This idea was first proposed in [15]. Algorithm [2] is a mod- 
ified version of the accelerated greedy algorithm presented in 
[4] and takes into account the independence constraint. The 
main modification from [4] is the check for independence in 
line E] 

Algorithm 2: generalGreedy((£', J 7 ), /) - Accelerated 

Input: An independence system (E,F). A function 

/ : T -> R. 
Output: Basis of T . 

1 Priority Queue Q <— 0; 

2 foreach e £ E do Q.insert(e,oo); 

3 while Q 7^ and S nof maximal in T do 



5 
6 
7 
8 
9 
10 
11 

12 
13 
14 



NULL; S n 



< oo; 



< Q.maxPriorityO do 



while Q ^ and 5„ 

e <- Q.popO; 
if T U {e} e J 7 then 

A(e)<-/(TU{e})-/CT); 
Q.insert(e,A(e)); 
if £ ma;E < A(e) then 



if Q ^ then 



5* S U {e ma;c }; 
Q.remove(e ma:c ); 



is return S; 



Unfortunately, the accelerated greedy algorithm has the 
same worst case bound as the naive version. However, 
empirical results have shown that it can achieve significant 
speedup factors [15], [4]. 

D. Approximations 

Here we consider some useful results for optimization over 
independence systems. First, lets look at the case of a linear 
objective function. In the special case of a matroid (p = 1), 
the greedy algorithm gives an optimal solution [14]. 

Lemma II.8. For the problem of finding the basis of a p- 
system that maximizes some linear non-negative function, a 



greedy algorithm gives a i approximation. 

In [9] the authors look at maximizing a submodular 
function over a uniform matroid (selecting k elements from 
a set). They show that the greedy algorithm gives a worst 
case approximation of 1 — - . This is actually the best factor 
that can be achieved as in [16] it is shown that to obtain a 
(1 — A + ^-approximation for any e > is NP-hard for the 
maximum fc-cover problem, which is the special case of a 
uniform matroid constraint. 

In [11], the optimization problem is generalized to an 
independence system represented as the intersection of P 
matroids. The authors state that the result can be extended 
to p-systems. A complete proof for this generalization is 
given in [10]. For a single matroid constraint, an algorithm 
to obtain a (1 — 1/e) approximation is also given in [10]. 

Lemma II.9. For the problem of finding the basis of a p- 
system that maximizes some monotone non-negative submod- 
ular function, a greedy algorithm gives a approximation. 

We know that linear functions (i.e. curvature is 0) are 
a special case of submodular functions so it is reasonable 
to expect the greedy bound to be a continuous function of 
the curvature. In [13], bounds that include the curvature are 
presented for single matroid systems, uniform matroids and 
independence systems. For a system that is the intersection 
of p matroids the greedy bound is shown to be j n me 
following we extend this result to p-systems. 

Theorem 11.10. Consider the problem of maximizing a 
monotone submodular function f with curvature k, over a 
p-system. Then, the greedy algorithm gives an approximation 
factor of —A—. 

J J p+K 

Proof. The proof is inspired from the proof where the system 
is the intersection of p integral polymatroids [13, Theo- 
rem 6.1]. Let W be the optimal set. Let S = {sx, . . . , Sk} 
be the result of the greedy algorithm where the elements 
are enumerated in the order in which they were chosen. For 



t = 1, 
f{S t ) - 



,k, let S t 



u 



and p t := p St - 1 (s t ) 



/(S t _i). Therefore, f(S) = £t=i/>f So, 



f(W US)> f(W) + J2 P(wus)\e(e) 
>/(W)+^(l-«)/(e) 



>f(W) + (l-K)Y,PU 

t=l 



(2) 



where the first and third inequalities hold due to submodu- 
larity and the second is by the definition of curvature. Also, 



f(WUS)<f(S) 



sew\s 



(3) 



Following from the analysis of the greedy algorithm for p- 
systems in [10, Appendix B], a fc-partition W\, . . . , W% of 
W, can be constructed such that \Wi\ < p and p t > ps t _! (e) 



for all e £ W t . Therefore, 

k 

Y Ps(e) = ^ Y Ps( e ) 

e£W\S i=l e£W t \S 

k k 

i=l i=l 

since ps(e) < ps t -ii e ) f° r a ^ * submodularity. 

Combining ([3]) and Q with Q, the desired result can be 
derived as follows, 

/(W) + (l-/s)/(5)</(5)+p/(5) 
=>/(W)<(p + «)/(S). 

□ 

£. Sef Systems on Graphs 

In this section we introduce some graph constructs and 
then give some results relating them to p-systems. 

We are given a graph G = (V, E, c, w) where V is the set 
of vertices and E is the set of edges. Each of the edges has 
a cost given by the function c : E — » R>o- A set of edges 
has a cost of c(S) = 2~2 q& s c (l)- A set of edges also has a 
reward or utility associated with it given by the submodular 
function 

A watt is a sequence of vertices (v\, . . . ,v n ) such that 
:= {vi,Vi+i} E E. A path is a walk such that e< ^ 
ej,Vi,j. A simple path is a path such that t>j =^ Vj,Vi,j — 
1 . . . n. A simple cycle is a path such that v.; ^ Vj,Vi,j = 
1 . . . n — 1, and i>„ = v\. A Hamiltonian cycle is a simple 
cycle that visits all the vertices of G. We will refer to a 
Hamiltonian cycle as a tour, and a simple cycle that is not 
a tour as a subtour. Let H C 2 E be the set of all tours in 
G. 

7 J b-matching: Given an undirected graph G = (V,E). 
Let S(v) denote the set of edges that are incident to v. 

Definition 11.11 (fe-matching). Given edge capacities u : 
E[G] -> N U 00 and vertex capacities b : V[G] -> N. A b- 
matching is an assignment to the edges / : E[G] —> Z + such 
that /(e) < u(e),Ve € E[G] and E ee s(v) /(e) < &(«),V« € 
V[G]. 

^ Eegifo) /( e ) — b(v)yv, the b-matching is perfect. 

A simple b-matching is the special case that the edge 

capacity u(e) = 1 for each e G E, 

For the rest of this paper, any reference to a b-matching 
will always refer to a simple b-matching. 

Theorem 11.12 (Mestre, [17]). A simple b-matching is a 2- 
extendible system. 

2) The Traveling Salesman Problem: Given a complete 
graph, the classical Traveling Salesman Problem (TSP) is to 
find a minimum cost tour. The TSP can be divided into two 
variants: the Asymmetric TSP and the Symmetric TSP. In 
the ATSP, for two vertices u and v, the cost of edge (u, v) is 
different from the cost of (v,u), which amounts to the graph 
being directed. In the STSP, c(u,v) = c(v,u), which is the 
case if the graph in undirected. 



In order to formulate the TSP, the set of possible solutions 
can be defined using an independence system. The base set of 
the system is the set of edges in the complete graph. For the 
ATSP, a set of edges is independent if they form a collection 
of vertex disjoint paths, or a complete Hamiltonian cycle. 

Theorem 11.13 (Mestre, [17]). The ATSP independence 
system is 3-extendible. 

The ATSP can be formulated as the intersection of 3 
matroids. These are: 

(i) Partition matroid: Edge sets such that the in-degree of 
each vertex < 1 

(ii) Partition matroid: Edge sets such that the out-degree 
of each vertex < 1 

(iii) The 1 -graphic matroid: the set of edges that form a 
forest with at most one simple cycle. 

The STSP is just a special case of the ATSP. Therefore, the 
results from the ATSP carry over to the STSP. Formulating 
it as an ATSP, however, requires doubling the edges in 
an undirected graph. Instead, we can directly define an 
independence system for the STSP. 

A set of edges is independent (i.e. belongs to the collection 
F) if the induced graph is composed of a collection of vertex 
disjont simple paths or a Hamiltonian cycle. This can also 
be characterized as the following two conditions: 

(i) Each vertex has degree at most 2 

(ii) There are no subtours 

Theorem 11.14. The undirected TSP independence system is 
3-extendible 

Proof. To show this, we can consider all the cases to show 
that the system satisfies the definition of a 3-extendible 
system. Specifically, assume some given set A C B G T and 
determine the number of edges that will need to be removed 
from B \ A so that adding any x = {it, v} ^ B (such that 
A U x £ F) to B will maintain independence. 

Adding an edge can violate the degree constraint on at 
most two vertices (specifically u and v) and/or the subtour 
constraint. To satisfy the degree constraint, at most one edge 
will need to be removed from B for each vertex. To satisfy 
the subtour requirement, at most one edge will need to 
be removed from the subtour in order to break the cycle. 
Therefore, up to three edges will have to be removed in 
total which means that the system is 3-extendible. 

One case where exactly three edges will have to be 
removed comes about if A contains an edge, e\, incident 
to u and another, e%, to v. If adding x to B violates both 
conditions of independence then we know there exists a path 
P = u ~-* v C B. Assume that both ei,e2 €E P. Then 
one edge will have to be removed from P to break the 
cycle (produced by adding x) and two more will need to be 
removed to satisfy the degree requirements at u and v. □ 

Since the STSP system is 3-extendible, it is also a 3- 
system. A better result is given in the following lemma. 

Lemma 11.15 (Jenkyns, [18]). On a graph with n vertices 
the undirected TSP is a p-system with p — 2— L^lPJ < ^- 



III. Problem Formulation 

Given a complete graph G — (V, E, w), where w is 
a submodular rewards function that has a curvature of k, 
we are interested in analysing simple algorithms to find a 
Hamiltonian tour that has the maximum reward The specific 
situation we look at is 



sen 



(5) 



In Section VII we will briefly discuss the problem of where 
costs are incorporated into the optimization problem. 

In the following sections, we look at two methods of 
approximately finding the optimal tour according to ([5]). 

IV. A Simple Greedy Strategy 

A greedy algorithm to construct the TSP is given in 
Algorithm [3] The idea is to pick the edge that will give 
the largest marginal benefit at each iteration. The selected 
edge cannot cause the degree of any vertex to be more than 
2 nor create any subtours. If it fails either criteria, the edge 
is discarded from future consideration. 

Theorem IV. 1. The complexity of the greedy tour algorithm 
(Alg. [?]) is 0(\V\ 3 (f + log |V|)), where f is the runtime of 
the oracle, and is a approximation. 



Proof. By Lemma II.15| and Theorem 11.10 Algorithm [3] is 



2 + K 



-approximation of d5]l. 



The calculation and selection of the element of maximum 
marginal benefit (line [3]l requires calculating the marginal 
benefit for each edge in E \ M. Note that recalculation of 
the marginal benefits need only be done when the set M 
is changed. Since only one edge is added to the tour M at 



each update in line 12 recalculation only needs to take place 
a total of |V| times. In addition, the edges will need to be 
sorted every time a recalculation is performed so that future 
calls to find the maximum benefit element can be done in 
constant time (if no recalculation is needed). Therefore, the 
recalculation and sorting take 0(|V|(|J5|/ + \E\ log \E\)). 
The runtime for DFS is 0(\V\ + \E\), but the DFS in run 
using only the edges picked so far, so its runtime becomes 
0(|V|). Assuming all the other commands take 0(1) time, 
the total runtime is 0{\V\(\E\f + \E\ log \E\) + \E\(\V\ + 
1)). For a complete graph, \E\ — 0(\V\ 2 ) and therefore the 
runtime becomes 0(\V\ 3 f + 2\V\ 3 log \V\ + \V\ 3 ). □ 

Remark IV.2. A more efficient data structure would be to 
use disjoint-sets for the vertices. Each set will represent a 
set of vertices that are in the same subtour. This gives a total 
runtime of |l/| 2 log|y| following the analysis in [19, Ch. 
21,23]. Unfortunately, the "recalculation" part still dominates 
the total runtime and so the resulting bound is the same. • 
Motivated by the reliance of the bound on the curvature, 
in the next section we will consider a method to obtain 
improved bounds for functions with a lower curvature. 



Algorithm 3: Greedy algorithm for TSP 
Input: Graph G = (V,E). Function oracle 

w : 2 E -> R>„ 
Output: Edge set M corresponding to a tour. 

1 M <- 0; 

2 vDeg «- lx \ v \; 

3 reCalc 4— true; 

4 while E ^ and \M\ < \V\ do 



9 

10 

11 

12 
13 

14 



■s— argmax„ 



\EPe 



{u,v} <- V[e m }; 

II Check if edge is valid 
addEdge 4- (vDeg[u\ < 2 and vDeg[v] < 2); 
if addEdge and \M\ < \V\ - 1 then 

// Check for potential subtour 
Run DFS on Gt — (V, M) starting at vertex u; 
if vertex v is hit then addEdge <— false; 

if addEdge then 
S <- M U {e m }; 

Increment vDeg[u] and vDeg[v]; 
E^E\{e m }; 



is return M; 



V. 2-Matching based tour 

Another approach to finding the optimal basis of an 
undirected TSP set system is to first relax the "no subtours" 
condition. The set system defined by the independence 
condition that each vertex can have a degree at most 2 is in 
fact just a simple 2-matching. As before, finding the optimal 
2-matching for a submodular function is a NP-hard problem. 
We discuss two methods to approximate a solution. The first 
is a greedy approach and the second is by using a linear 
relaxation of the submodular function. We will see that the 
bounds with linear relaxation will be better than the greedy 
approach for certain values of curvature. 

A. Greedy 2-Matching 

One way to approximate the solution to the problem of 
maximizing a submodular function to finding a maximum 
2-matching is to use a greedy approach. 

Theorem V.l. The complexity of the greedy matching algo- 
rithm Q is 0{\V\ 3 {f + log |V|)), where f is the runtime of 
the oracle. The greedy approach is a -^^-approximation. 

Proof. Similar to the greedy tour, picking the edge of 
maximum benefit requires requires recalculation of all the 
marginal benefits and only need to be done \V\ times. So 
picking the best edge requires 0(|V|(|i?|/ + \E\ log \E\)) 
time. All other parts require 8(1) time per iteration (for 
\E\ iterations). Therefore, the total runtime is 0(|V^| 3 / + 
2\V\ 3 \og\V\). 

Since a simple 2-matching is a 2-extendible system, the 
greedy solution will be within ^tp^ of the optimal by 
Theorem [ELK)] " □ 



Algorithm 4: Greedy algorithm for Maximal Matching 
Input: Graph G = (V, E). Function oracle 

/ : 2 E -> M>o 
Output: A simple 2-matching M 

1 M <- 0; 

2 vDeg <- lx \ v \; 

3 reC'alc <s— true; 

4 while E ^ do 



argmax„ 



\EPe 



{u,v} <r- V[e m }; 

II Check if edge is valid 
if vDeg[u] < 2 and vDeg[v] < 2 then 
S 4- M U {e m }; 

Increment vDeg[u] and vl}e<?[i>]; 
if) £'^£;\{e m }; 
li return M; 



B. Maximum 2-Matching Linear Relaxation 

For a linear objective function, the problem of finding a 
maximum weight 2-matching can be formulated as a binary 
integer program. Let x — {xij} where 1 < i < j < \V\ 
and let each edge be assigned a real positive weight given 
by Wij. Define E(x) as the set of edges for which scy = 1. 
Then the maximum weight 2-matching, (V,E(x)), can be 
obtained by solving 

\v\-i 

max uijj x^ 



S.t. Xij 



i—l j>i 



7 j x 3i 



2, ViG{l, 

^■€{0,1}, l<i<j<\V\. 



\V\} 



This method, however, does not have any good bounds on 
runtime. Alternatively, for a weighted graph the maximum 
weight 2-matching can be found in 0(n 3 ) time [14] via 
an extension of Edmonds' Maximum Weighted Matching 
algorithm. 

For our original problem with as the objective function 
for the maximization, the two methods described here can 
obviously not be applied directly. Therefore, we define a 
linear relaxation w of the submodular function w as follows, 



w 



(S) = 5>(e) = £A (e), V5 C E. 



(6) 



e<£S 



In other words, we give each edge a value that is the 
maximum possible marginal benefit that it can have. Using 
this relaxation, the optimal 2-matching based on the weights 
w can be calculated. 

Theorem V.2. Let Mi be the maximum 2-matching for a 
submodular rewards function w and let M 2 be the maximum 
weight 2-matching using w as the edge weights. If w has an 
independence system curvature of Ki, then 

w{M 2 ) > (1- k/)«)(Mi). 



Proof. The definition of curvature states that Ag(e) > (1 — 
Ki)w(e) for any independent subset S of the edges and e 
such that S U {e} is also independent. Using this and the 
definition of submodularity, 

w(M 2 ) = Afl(ei) + A {ei} (e 2 ) + A {ei!62} (e 3 ) + . . . 

eGM 2 

= (1 - ki)w(M 2 ). 

Since w(M 2 ) is maximum, wiM-^) > w(Mi). Therefore, 

w(M 2 ) > (1 - k/)w(Mx) > (1 - ki)w(Mi), 

due to decreasing marginal benefits. Note that this bound 
also holds using the standard definition of curvature. □ 

C. Reduced 2-Matching 

The output of either of the two algorithms described will 
be a basis of the 2-matching system. Once a maximal 2- 
matching has been obtained, it needs to be converted into 
a tour. The edge set corresponding to the 2-matching can 
be divided into a collection of disjoint sets. These sets will 
either be subtours or simple paths. Any simple path can be at 
most one edge in length since otherwise its endpoints could 
be joined together (as the graph is complete) contradicting 
the maximality of the matching. In addition, only one of 
the disjoint sets will correspond to a simple path and it 
will contain either one edge or a single vertex. Therefore, 
a maximal 2-matching will consist of a collection of vertex 
disjoint subtours and at most one extra edge. 

In order to convert the maximal 2-matching to a tour, the 
subtours will have to be broken by removing an edge from 
each one. The remaining set of simple paths will then need 
to be connected up. 

We first give a result on efficiently finding a subset to 
remove from a set while maintaining | of the value. We 
then give an algorithm to reduce a set of subtours starting 
from a maximal 2-matching. 

1) Removing elements from a set: Given a set S and a re- 
partition of the set {A 1 }™ v i.e. S = U™ i A i and that A 1 n 
Ai = for all i ^ j. Each part contains m elements, A % = 
Uj=i a j' sucn tnat 3 < rii < N. Let k — mi^ n i an d let 
n = . . . , n m ). Also given a monotone non-decreasing 
submodular function / : 2 s -> M> . Let := A 1 \ aj. 

Theorem V.3. Given set S and disjoint subsets A 1 ,i — 
1, . . . , m, where k is the size of the smallest subset A 1 , 
as defined above. There exists a set R of m elements ( to 
which each set A 1 contributes exactly one element) such that 

f(R)>(i-Df(s)>y(s). 

Proof. A selection of one element from each set can be given 
by the vector p e Z™,p < n. Let b p = /(S)-/(U=i At pi ) 
be the unique contribution of the selected elements to the 
total reward of S. For convenience of notation, define i := 

The following lemma will help show the desired result. 
Lemma V.4. J^Li b i < f( s )- 



Proof. The basic argument is that bj is the minimum possible 
contribution of the set (J . a\ . So the total contribution over 
different i will be less than (or equal to) the sum of their 
minimum contributions. 

Let Bi = Uj=i a i be the set obtained by selecting the ith 
element from each set. 

Using the definition of marginal benefit As(X), we can 
write f(T) = At(B 1 )+A Bl (B 2 ) + ... + A T \ Bk (B k ). 

However, by submodularity, bi = As\g i (.Bj) < 
A x (B,),yX C S \ Bi. Therefore, f(T) > 

Eli &s\ Bi (Bi). 

Combining this with the fact that T C S => f(T) < 
f(S) by monotonicity, we get the desired result. □ 

To prove the theorem statement, assume that there does 
not exist any set with the desired property, i.e. Vp G 

Z%,p < n we have f(\Jti A - Pi ) < i 1 - l)f( S Y From 
this assumption, we can see that b v > 4/(5). Therefore, 



V.4 



the desired result is 



Ei=i & i > f( s )- With Lemma 
obtained by contradiction. 

For the second part of the inequality, since k = min rii > 
3, so (1 -£)>§. " □ 

Corollary V.5. Given a set S such that \S\ > mk, then 
there exists a subset T of m elements such that f(S \ T) > 
(1 - l)f(S). 

Proof. Since \S\ > mk, S can be divided into at least m 
subsets of size > k each. Creating such a division means 
that S can be represented as U2a A% such that \ A l \ > k,Vi. 
Therefore, the result of Theorem |V.3| applies. □ 



2) Algorithm to remove one element per set: Based on the 
above results for existence of a set that can be removed while 
maintaining at least | of the original value, we introduce a 
simple technique (given in Algorithm [5]l to find such a set 
by searching over a finite number of disjoint sets. 



Algorithm 5: ReduceSet(S, A 1 ,..., A m ) 

Input: S = U™ i A* where A i = U"=i oj 
Output: U C S s.t. [/ni'=l for all % = 1 . 
l i i— 1; k <— mixij \ A^\; 

.- _ 

k 



3 while f(S\U) < *jrf(S) do 



i «— i + 1; 

u := ur=i 



6 return [/; 



Theorem V.6. Algorithm [3] is correct. 

To prove correctness, it will help to establish the following 
result. 

Lemma V.7. Given a set S composed of m disjoint subsets 
as defined above. Then there exists i 6 {1, . . . , k} such that 

f([JT=i A -i)>a-l)f(s). 

Proof. The proof follows a similar logic as the proof of 
Theorem IV. 3 1 



,*}, /(u;=i^)<^/(s). 

-/(U7=i^)>s/(s)- Which 



Assume that V« G {1, .. 
So Vie {1,...,*}, f(S) 

implies Eti/(5) - /(U™i^) > 1/(5) = f(S). 
But we know that f(S) > ZLif(S) ~ f(UT=l A -i) 
(by Lemma |V.4[ ). Therefore, the desired result is shown by 
contradiction. □ 



Proof of Theorem V.6 The algorithm (randomly) selects one 
element to remove from each set. If the removal means more 
than 4 of the reward is lost, another set of elements is chosen 
that is disjoint from the previously chosen one. 

Since each set A 1 contains atleast k elements, there 
are k possible disjoint sets to choose from (for removal). 



Lemma V.7 states that one of these is guaranteed to have the 
desired property of lowering the objective by at most 4. □ 

Theorem V.8. The complexity of Algorithm^is O(kf) — 
0(\S\f). 

3) Algorithm to delete edges: We can now use Algo- 
rithm|5]to remove one edge from each subtour in a matching. 

The subtours, T\ can be considered as a collection of 
disjoint edge sets. Each subtour will consist of atleast three 
edges. Since we want to remove one element from each set 
while trying to maximize a submodular reward function, the 



results of Theorem V.3 apply and so we know that there 



will exist a solution such that at most | of the value of the 
objective is lost. 

If there exists an extra edge not part of any subtour, it does 
not affect the result since, following from Corollary |V.5| it 
can just be considered as part of one of the other subtours. 
Since any k of the elements of any subtour are needed for 
the algorithm, these k can be chosen to not include the extra 
edge. Outlined in Algorithm [6] is an method that will find 
such a set of edges to remove. 

Algorithm 6: Reduce Matching 
Input: A 2-matching Gm = (V, M) where 

AI = 1J™ 1 T l and the sets T % are the subtours. 
Output: R C M such that each subtour has one edge 
removed. 

1 n <- 0; 

2 for z = 1 ... m do 

3 if \T l \ > 1 then 

4 A n = T l ; 

5 Increment n; 

6 return reduceSet(M, A 1 ,..., A n ); 



Theorem V.9. Algorithm [6] correctly reduces the matching 
while maintaining | of the original value. The complexity of 
the algorithm is 0{kf) = 0(\V\f) 



D. Tour using matching algorithm 

We now present an outline of the complete 2-matching 
tour algorithm. The steps are illustrated in Figure [T] 



a) Initial 




(b) 2-Matching 



(c) Reduced 



(d) Tour 






Fig. 1. The steps in the 2-matching based tour algorithm 

(i) Run Algorithm [4] to get a simple 2-matching, Mi. 
Using the linear relaxation w of w, solve for the 
maximum weight 2-matching, A/ 2 . From Mi and M 2 , 
choose the 2-matching that has a higher reward. 

(ii) Find all sets of sub tours, (using, for example, DFS). 

(iii) Run Algorithm [6] to select edges to remove. 

(iv) Connect up the reduced subtours into a tour. One 
method to accomplish this would be by finding all the 
vertices that have degree of or 1 and then arbitrarily 
connecting them up to complete the tour. 

Theorem V.10. The 2-matching tour algorithm gives a 
max | 3 (2+k) ' f(l — n)^ -approximation in 0(f(n 3 + n) + 
n 3 logn) time. 

Proof. The matching is a max < , 



1 



k | -approximation 



from Theorems V.l and V.2 Given that a tour is a special 



case of the maximum simple 2-matching, the maximum tour 
has a value less than or equal to the optimal 2-matching. 
Removing edges from the matching produces a subgraph 
that is atleast I of the original value of the 2-matching 



(Theorem V.9 1. Adding more edges to complete the tour will 
only increase the value. Therefore, the resulting tour is within 
max | ' f(l — K )} °f th fi optimal tour. Note that the 

bound given is relative to the optimal 2-matching and not to 
the optimal tour so the actual bound may be better. 

The first step involves building the matching. The greedy 
matching takes 0(|V| 3 (/ + log |V|)) time and the linear 
relaxation approximation takes 0(|U| 3 ) time. Finding the 
edges in all the components is 0(|y| + = 0(|V|) since 
the number of edges in a matching is at most |V|. Removing 
the edges is 0(|y|/). Assuming it is easy to find the free 
vertices, connecting up the final graph is 0(m) = 0(\V\) 
where m is the number of subtours and ranges from 1 to 
^■j. Therefore, the total runtime is 0(n 3 (f + logn) + 
n(2 + /)) = 0(/(n 3 + ?i)+n 3 logn). □ 

The reason for calculating the 2-matching twice is now 
explained. The greedy method gives a \ -approximation in 
the case of a linear objective (Lemma [ll.8[ ). A similar method 
of using a matching is used in [1] for a linear reward 
function. In that case, the approximation ratio achieved is 
| of the optimal tour. The reason for this is that for a linear 
function, an optimal perfect matching is obtained and the 
bound depends only on how much is lost in removing the 
edges. We showed that a similar bound limiting the loss can 
be obtained for the submodular case; however, just using 
the greedy approximation for the 2-matching, we cannot 
guarantee as good a result since the final tour would be 
within |i = | of the optimal. By also using the second 



method of finding the 2-matching, our resulting bound for 
the final tour in the case of a linear function improves to |. 
Remark V.ll. For any value of k < |(V3 - 1) « 0.366, 
using the linear relaxation method to construct a 2-matching 
and then converting it into a tour, gives a better bound with 
respect to the optimal tour than by using the greedy tour 
approach. • 
Remark V.12. In our case, the | loss is actually a worst case 
bound where all the subtours are composed of 3 edges. For 
a graph with a large number of vertices, the subtours will 
probably be larger and so k may be larger than 3. This would 
lead to a better bound for the algorithm. • 
Remark V.l 3. In removing the edges we used an algorithm 
to quickly find a "good" set of edges to remove but made 
no effort to look for the "best" set. Using a more intelligent 
heuristic we would get better results (of course at the cost 
of a longer runtime). 

Also, the last step where the reduced subtours are con- 
nected into a tour can be achieved using various different 
techniques. As mentioned above, one possibility is to just ar- 
bitrarily connect up the components. Another method would 
be to use a greedy approach (so running Algorithm [3] except 
with an initial state). This would not change the worst case 



runtime given in Theorem V.10 Alternatively, if the number 



of subtours is found to be small, an exhaustive search going 
through all the possibilities could be performed. • 

VI. Extension to directed graphs 

The algorithms described can also be applied to directed 
graphs yielding approximations for the ATSR 

A. Greedy Tour 

For the greedy tour algorithm, a slight modification needs 
to be made to check that the in-degree and out-degree of the 
vertices are less than 1 instead of checking for the degree 
being less than 2. Since the ATSP is a 3-extendible system, 
the approximation of the greedy algorithm changes to 
instead of as in the undirected case. 

B. Tour using Matching 

Instead of working with a 2-matching, the system can be 
modelled as the intersection of two partition matroids: 

• Edge sets such that the indegree of each vertex < 1. 

• Edge sets such that the outdegree of each vertex < 1. 
This system is still 2-extendible and so the approximation 
for the greedy 2-matching does not change. For the second 
approximation, the Maximum Assignment Problem (max 
AP) can be solved by representing the weights in |6} as 
a weight matrix W where we set Wu — —00. The Hun- 
garian algorithm, that has a complexity of 0(n 3 ), can be 
applied to obtain an optimal solution. Therefore, the result 



of Theorem V.2 still applies 



The result of the greedy algorithm or the solution to the 
assignment problem will be a set of edges that together form 
a set of cycles, with the possibility of a lone vertex. Note 
that a "cycle" could potentially consist of just two vertices. 
Therefore, removing one edge from each cycle will result 



in a loss of at most ~ instead of ~. This follows directly 
from the analysis in Theorem V.3 but using k = 2 instead 
of k = 3. The final bound for the algorithm is therefore 

maX {2(2Wl( 1 - 

VII. Incorporating Costs 

Often times, optimization algorithms have to deal with 
multiple objectives. In our case, we can consider the tradeoff 
between the reward of a set and its associated cost. A number 
of algorithms presented in literature look at attempting to 
maximize the benefit given a "budget" or a bound on the 
cost, i.e. find a tour T such that 

T £ argmax u>(S) s.t. c(S) < k. 
sen 

This involves maximizing a monotone non-decreasing sub- 
modular function over a knapsack constraint as well as an 
independence system constraint. 

We will work with a different form of the objective 
function defined by a weighted combination of the reward 
and cost. For a given value of (3 £ [0,1], solve 



T £ argmax /(S) 
sen 

f(S,0) = (l-P)w{S)-pc(S), 



(7) 
(8) 



where the combined objective is a non-monotone possibly 
negative submodular function. An advantage of having this 
form for the objective is that the "cost trade-off is being 
incorporated directly into the value being optimized. Since 
the cost function is modular, maximizing the negative of the 
cost is equivalent to minimizing the cost. So the combined 
objective tries to maximise the reward and minimize the cost 
at the same time. 

The parameter j3 is used as a weighting mechanism. The 
case of ft = corresponds to ignoring costs and that of 
P = 1 corresponds to ignoring rewards and just minimizing 
the cost (this would just be the traditional TSP). 

One minor issue with the proposed function is that the 
rewards and costs may have different relative magnitudes. 
This might mean that /3 is biased to either or 1 . Normalizing 
the values of w and c will help bring both the values down 
to a similar scale. This gives the advantage of being able to 
use f3 as an unbiased tuning parameter. 

Therefore, the final definition of the objective function is 



f(S) 



1-/3 



w(S) 



A 

M r 



c(S) 



(9) 



where M c = maxs c(S) and M w = maxg w(S). The exact 
values of M c and M w may be hard to calculate and so could 
be approximated. 

To address the issue of the function being non-monotone 
and negative, consider the alternative modified cost function 



This gives the following form for the objective function, 

f'(S,l3) = (l-p)w(S) + Pc'(S) (10) 
= (l-P)w(S) + P(\S\M - c(S)), 
= f(S,P) + p\S\M, 

which is a monotone non-decreasing non-negative submod- 
ular function. This has the advantage of offering known 
approximation bounds. The costs have in a sense been "in- 
verted" and so maximizing d still corresponds to minimizing 
the cost c. 

Remark VII. 1. Instead of using \S\M as the offset, we could 
use the sum of the IS*! largest costs in E. This would not 
improve the worst case bound but may help to improve 
results in practice. • 

Lemma VII.2. For any two sets S\ and S 2 , if f'(Si) > 
af(S 2 ) then /(S x ) > af(S 2 ) + M(a\S 2 \ ~ \s\\) 



Proof. 



f'(Si) > af(S 2 ) 

/(Si) + |Si|M>a(/(S 2 ) + |S 2 |M) 
/(Si)>a/(52)+M(a|S 2 |-|5i|) 



□ 



Remark VII. 3. As a special case, if |Si| = \S 2 \, then 
/(Si) > /(S 2 ) if and only if /'(Si) > /'(S 2 ). Therefore, it 
can be deduced that over all sets of the same size, the one 
that maximizes §8^ is the same one that maximizes (JTOj. • 

Theorem VII.4. Let G\ be the greedy solution obtained 
maximizing (|8j and G 2 be the greedy solution maximizing 
Then Gi = G 2 . 

Proof. At each iteration i of the greedy algorithm, we are 
finding the element that will give the maximum value for a 
set of size i + 1. Since comparison is being done between 
sets of the same size, the same element will be chosen at 
each iteration. □ 

Theorem VII.5. Consider a function f = fx — ,f 2 , where 
fi is submodular and f 2 is modular, and a p-system (E, J 7 ). 
An a- approximation to the problem maxjg^ /'(S), where 
/'(S) = /(S) + \S\M 1 M = max e£ £ f 2 (e), corresponds to 
an approximation of aOPT — (l + a — 2^)M?i, for the 
problem maxsgjF /(S), where n is the size of the maximum 
cardinality basis. 

Proof. Let S be the solution obtained by a a-approximation 
algorithm to /'. Let T be the optimal solution using /'. 
Let Z be the optimal solution using /. Note the following 
inequality: 



\A\ < \B\p 



a\B\-\A\>\B\(a-p)>\A\(--l) 



P 



c'(S) = \S\M-c(S), 



M = maxc(e). 

eeE 



By using the property of p-systems that for any two bases 
A and B, 

\B\ 



and by Lemma |VII.2| we have 

f(S) > af(T) => f(S) > af(T) + M (a\T\ - \S\) 
So, 

f(S)>af(T)-M\T\(p-a) 
> af(T) - M\S\ (l - 



> af{T) - Mn 1 - 



Also, 

f'(T) > f(Z) 
Substituting, 



f(T) > f(Z) - M\T\ 1 - 



f{S)>af{Z)-aM\T\[\ 



Mn 1 



> af(Z) - Mn ( 1 + -a 



□ 

Remark VII. 6. In the special case of a 1 -system (this includes 
matroids), or more generally any problem where the output 
to the algorithm will always be the same size, we have l^l = 
|T| = \Z\ and also T = Z following from Remark |vTl7 



This means that an algorithm that gives a relative error of 
a when using /' as the objective will give a normalized 
relative error of a when using / as the objective (i.e. f(S) > 
aOPT + (1 - a)WORST). • 

Corollary VII.7. The problem maxsg^ f(S) can b e a P~ 
proximated to (p + I)- 1 OPT + (1 + e)WORST (where 



€ G 



J, 1) is a function of p) using a greedy algorithm. 



Proof Let f(S) = f(S) + \S\M,M = max eeB / 2 (e) and 
run the greedy algorithm to obtain the set Tq. Let Let T and 
Z be defined as above. Since /' is a non-negative monotone 
function, 

f\T G ) > — \—f'{T). 



P 



r 



So applying Theorem VII. 5 
1 



f(T G 



> 



P 



1 



f(Z) - Mn 1 + 



pip + 1) 



A. Performance Bounds with Costs 

Using the proposed modification, new bounds can be 
derived for the algorithms discussed in this paper. One thing 
to note is that in the case of the tour, all tours will be the same 
length that is \V\, even though the tour is not a 1 -system. 



Therefore, we can apply Lemma VII. 2 directly. 



Theorem VII.8 (Greedy tour with edge costs). Using ^ as 
the objective, Algorithm^outputs a tour that has a value at 
least | OPT — ^(3M\V\ where M is the maximum cost of 
any edge. 





(a) Example graph. (b) Greedy solution. 

Fig. 2. A ten vertex graph and example solution. 



Theorem VII.9 (Tour via 2-matching with edge costs). 
Using ([8} as the objective, the tour based on a matching 
algorithm outputs a tour that has a value at least ^OPT — 
t(3M\V\ where M is the maximum cost of any edge. 

The bounds given in these theorems are not the best 
that can be obtained. For the case when j3 is small, the 
submodular reward is weighted higher and the bound is 
closer to that of maximizing a submodular function. On the 
other hand, for the case of large f3 (specifically when j3 = 1), 
the problem is just the traditional minimum cost TSP. In [18] 
an approximation of \{OPT + WORST) was given for 
finding a minimum TSP using a greedy approach which is 
better than the \OPT+ jWORST that we calculate for the 
greedy tour with costs. In addition, simple fast methods also 
exist to find the minimum cost tour in a graph using other 
approaches. Therefore, if the costs are to be given a higher 
weight, the analysis given here is not very informative of the 
resulting solution. 

VIII. Simulations 

In order to empirically compare our algorithms, we have 
run simulations for a function that represents coverage of an 
environment. A complete graph is generated by uniformly 
placing vertices over a rectangular region. Each edge in the 
graph is associated with a rectangle and each rectangle is 
assigned a width to represent different amounts of coverage. 

An example of a ten vertex graph is given in Figure 2a 
Here we see the complete graph as well as a representation 
of the value of each edge given by the area of the rectangle 
the edge corresponds to. The majority of edges have a low 
weight with a few having a much larger value. Running the 



LI greedy tour algorithm, we get the tour given in Figure 2b 



The simulations were performed on a quad-core machine 
with a 3.10 GHz CPU and 6GB RAM. To decrease total 
runtime, three instances of problems were run in parallel on 
different Matlab® sessions. 

A. Algorithm Comparison 

Next we compare the performance of the algorithms given 
in this paper. Each algorithm was run on randomly generated 
graphs for a fixed number of vertices. The resulting value of 
the objective function was recorded and averaged over all 
instances. The algorithms compared are: 
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Fig. 3. (Top) The bars give the range of results. The white markers inside 
the bars show the mean and standard deviation. (Bottom) Number of wins 
for each algorithm. Wins include ties (unique wins specified in parens). 



• GreedyTour (GT): The greedy algorithm for construct- 
ing a tour. 

• RandomTour (RT): Edges are considered in a random 
order. An edge is selected to be part of the tour as long 
as the degree constraints will be satisfied and no subtour 
will be created. 

• For the 2-matching based algorithm, three possibilities 
are considered. All three start off by greedily construct- 
ing a 2-matching. 

- GreedyMatching (GM): Remove from each subtour 
the element that will result in the least loss to the 
total value. Greedily connect up the complete tour. 

- GreedyMatching2 (GM2): Use Algorithm [6] to re- 
duce the matching. Greedily connect up the com- 
plete tour. 

- GreedyMatching3 (GM3): Use Algorithm [6] to 
reduce the matching. Arbitrarily connect up the 
complete tour. 

The vertices were distributed randomly over a 100x100 
region. For the first simulation, the edge thickness was 
assigned a value of 7 with probability ^7=' or 1 otherwise. 

This way, 0(|V|) of the edges had a high reward. A 
total of 30 different instances of the problem were solved 
by all the algorithms for 5 different graph sizes. For the 
second simulation, the set up was the same except the edges 
thickness were distributed uniformly over [0, 7] and a total 
of 40 instances were averaged. 

Average runtimes of each algorithm are shown in Figure |4] 
The results for the first setup are shown in Figure [3] The 
table gives a count of the number of times each algorithm 
had the largest value. Average runtimes of each algorithm 
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Fig. 4. Average runtime (seconds) of algorithms. 

are shown in Figure [4] For the second setup, the solution 
values and number of wins are given in Figure [5] 

The RT algorithm performs poorly in each case. This is 
expected as no effort is put into finding good edges. For the 
first setup, most of the edges have a low reward and so the 
rewards of any random set of edges will be biased towards 
a small value. In the second setup, since the distribution 
is uniform, the expected value of the tour increases though 
getting close to the "best" tour is still not likely. 

Both GT and GM perform similarly well on average (note 
that the number of ties is high especially for smaller graph 
sizes) though GM takes a lot longer to run. This can be 
explained due to the extra oracle calls required to determine 
which edge to remove from each subtour (a total of |V| extra 
oracle calls). 

Both GM2 and GM3 are slightly behind GM in terms of 
the final value. This makes sense since in GM more effort is 
put into finding a good reduction of the 2-matching whereas 
in GM2 only an attempt to find a good set of edges to remove 
is made. Between GM2 and GM3, the solution values are 
very close though GM2 runs slightly slower than GM3. The 
only difference between the two techniques is that the joining 
of the reduced subtours is performed randomly for GM3. 
This requires no oracle calls leading to a faster runtime. In 
this particular set up, the number of subtours was o(| V|) so 
very few calculations were needed to construct the final tour 
from the reduced subtours. It is however possible for the 
number of subtours to be 0(|V|) and in those cases GM2 
would be much slower as the problem size would not be 
significantly reduced by first coming up with a 2-matching. 

B. Dependence on Curvature 

To illustrate how the results of the 2-matching based 
algorithm changes with curvature, the values of the greedy 
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Fig. 5. (Top) The bars give the range of results. The white markers inside 
the bars show the mean and standard deviation. (Bottom) Number of wins 
for each algorithm. Wins include ties (unique wins specified in parens). 



matching and the linear approximation are now compared. 
The submodular function is modified to be 

fnew(S) = f(S) + ^ length(e). 

Since f(S) is the total area, its value depends on the 
thickness of the edges. For a small thickness, the overlaps 
in the area between different edges will be small and so the 
function will be more linear. Therefore, there is a positive 
correlation between the edge thickness and the curvature of 
the function. 

The experiment is performed on a ten vertex graph. The 
two 2-matching approximations are compared and the results 
are shown in Figure [6] For a second test, the GreedyTour 
and the 2-matching algorithm are compared and the average 
for 20 different ten vertex graphs is shown in Figure [7] In 
the figure, the "LGmatching" algorithm creates a greedy 2- 
matching as well as the linear approximation and takes the 
best of the two. The best edge from each subtour is removed 
and the tour is then constructed greedily. The "Lmatching" 
algorithm runs only the linear approximation to find the 2- 
matching. Figure [8] shows how the curvature changes with 
edge thickness. 

From Figure [6] we can see that for the case where the 
function is completely linear (thickness is 0), the linear 
approximation does better (since it is actually finding the 
optimal). The greedy matching starts to perform better at a 
curvature of around 0.53. Looking at the results for the value 
of the actual tour (Figure |7J, we can see that at low values 
of curvature the linear approximation is being used to create 
the tour. Eventually, greedily constructing the 2-matching 
becomes more rewarding and so the linear approximation 





Fig. 6. Curvature comparison of 2-matching algorithms 




Fig. 7. Curvature comparison of final tour 



is disregarded. Generally, over all the values of curvature 
tested, the matching algorithm performs close to or better 
that the greedy tour algorithm. 

IX. Conclusions and Future Directions 

In this paper, we extended the max-TSP problem 
to submodular rewards. We presented two algorithms; 
a greedy algorithm which achieves a ^7 approxima- 
tion, and a matching-based algorithm, which achieves a 
max{ 3 ^ 2 ^_ K j , |(1 — k)} approximation (where k is the curva- 
ture of the function). Both algorithms have a complexity of 
0(|y| 3 ) in terms of number of oracle calls. We extended 
these results to directed graphs and presented simulation 
results to empirically compare their performance as well as 
evaluating the dependence on curvature. 

There are several directions for future work. First, we 
would like to determine the tightness of the bounds that 
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Fig. 8. Curvature as a function of edge thickness 

were presented. The class of submodular functions is very 
broad and so adding further restrictions may help give a 
better idea of how the bounds change for specific situa- 
tions. Another direction of research would be considering 
extending other algorithms. The strategies presented in this 
paper are extensions of simple algorithms that are used to 
obtain approximations for the traditional TSP. There are 
many other simple strategies that could also be extended such 
as best neighbour or insertion heuristics. One other possible 
extension would be to consider the case where multiple tours 
are needed (such as with multiple patrolling robots). 
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